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ABSTRACT 


This  paper  discusses  a  distributed  version  of  Wald’s  sequential  hypothesis  testing  problem  in  the  con¬ 
tinuous  time  framework.  For  sake  of  concreteness,  two  decision-makers  equipped  with  their  own  sensors,  are 
faced  with  the  following  hypothesis  testing  problem  :  Decide  between  hypothesis  Ho  and  Hi,  where 

Under  Hi  :  dX\  =  Hi  dt  +  cr,  dWl,  i  —  1,  2 

Under  H0  :  dX\  =  <n  dW't,  i  =  1,  2, 

with  m  /  0  and  <rt-  ^  0,  t  =  1,  2,  non-random;  here  the  noises  {W/,  t  >  0}  and  {Wt2,  t  >  0}  are  independent 

Brownian  motions. 

Data  is  observed  continuously  and  at  each  instant  in  time,  each  decision-maker  can  either  declare  one  of 
the  hypotheses  to  be  true  or  continue  collecting  data.  In  either  case,  they  base  their  individual  decisions  on 
the  data  collected  by  their  own  sensors  up  to  that  time;  they  do  not  communicate  with  each  other  and  so  do 
not  share  information.  The  decisions  are  selected  to  minimize  a  joint  cost  function  with  two  components,  the 
first  one  capturing  the  cost  for  collecting  data,  and  the  second  assessing  the  cost  for  incorrect  decisions.  This 
is  the  simplest  problem  of  its  type,  for  the  coupling  between  the  two  decision-makers  occurs  only  through 
the  cost  structure.  This  problem  was  considered  first  in  discrete-time  by  Teneketzis  [6]  who  showed  that  the 
person-by-person  optimal  strategy  was  of  threshold  type  for  each  sensor.  Here  a  similar  result  is  derived  by 
simple  and  direct  arguments  based  on  well-known  facts  for  the  single  detector  problem.  Moreover,  explicit 
formulae  are  derived  for  this  joint  cost  function  when  the  detector  policies  are  of  threshold  type,  owing  to 
the  fact  that  at  the  decision  times,  the  likelihood  functionals  assume  one  of  two  threshold  values  owing  to 
the  continuity  of  the  paths  of  Brownian  motion.  This  is  in  sharp  contrast  with  the  overshoot  phenomena 
that  leads  in  the  discrete-time  situation  to  the  celebrated  Wald  approximations.  These  explicit  formulae 
not  only  vividly  display  the  cost  interaction  taking  place  between  the  two  sensors  but  readily  allow  for  a 
reduction  of  the  original  problem  to  a  mathematical  programming  problem  in  four  variables  over  a  simple 
constraint  set. 
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1.  INTRODUCTION: 


Consider  the  following  sequential  testing  of  two  simple  hypotheses  Ho  and  Hi  with  two  decision-makers: 
Decision-maker  t  is  equipped  with  its  own  sensor  and  observes  the  increments  dX\  of  the  stochastic  process 
X1  =  {XI,  t  >  0}  with  values  in  JR.  Under  each  hypothesis,  the  observed  data  is  the  output  of  a  stochastic 
dynamical  system,  i.e., 

Under  Hi  :  dX\  =  Hidt+  cr,  dW t‘,  i  =  1, 2 

Under  H0  :  dX\  =  dW't,  i  =  1,  2, 

where  Hi  ^  0  and  <Ti  ^  0,  i  =  1,2,  are  non-random.  Here  the  noises  {W±  ,  t  >  0}  and  {Wf ,  t  >  0}  are 
independent  Brownian  motions.  As  usual,  hypotheses  Hi  and  H0  correspond  to  the  “signal  in  noise”  and 
“noise  only”  situations  extensively  treated  in  the  literature  on  signal  detection. 

Data  is  observed  continuously  starting  at  an  initial  time  which  is  taken  for  convenience  to  be  zero. 
At  each  time  t  >  0,  each  decision  maker  can  either  declare  one  of  the  hypotheses  to  be  true  or  continue 
collecting  data.  In  either  case,  the  decision  makers  base  their  individual  decisions  on  the  data  collected  by 
their  own  sensors  up  to  that  time;  they  do  not  communicate  with  each  other  and  so  do  not  share  information. 
These  decisions  are  made  in  some  “optimal”  fashion  in  that  they  minimize  a  joint  cost  function  with  two 
components,  the  first  one  capturing  the  cost  for  collecting  data  and  the  second  assessing  the  cost  for  incorrect 
decisions.  This  is  the  simplest  problem  of  its  type,  for  the  coupling  between  the  two  decision-makers  occurs 
only  through  the  cost  structure.  Such  a  problem  was  considered  first  in  discrete-time  by  Teneketzis  [6]  who 
showed  that  the  person-by-person  optimal  strategy  was  of  threshold  type  for  each  decision-maker.  Here  a 
similar  result  is  derived  by  simple  and  direct  arguments  based  on  well-known  facts  for  the  single  detector 
problem. 

The  advantages  of  the  continuous-time  formulation  over  the  discrete— time  one  lie  in  the  following 
features:  Explicit  formulae  can  be  derived  for  the  joint  cost  function  when  the  detector  policies  are  of 
threshold  type.  This  follows  from  the  fact  that  at  the  decision  times,  the  likelihood  functionals  assume 
one  of  two  threshold  values  owing  to  the  continuity  of  the  paths  of  Brownian  motion.  This  is  in  sharp 
contrast  with  the  overshoot  phenomena  that  leads  in  the  discrete-time  situation  to  the  celebrated  Wald 
approximations.  These  explicit  formulae  not  only  vividly  display  the  cost  interaction  taking  place  between 
the  two  sensors  but  readily  allow  for  a  reduction  of  the  original  problem  to  a  mathematical  programming 
problem  in  four  variables  over  a  simple  constraint  set. 

The  paper  is  organized  as  follows:  The  precise  problem  formulation  is  given  in  Section  2,  the  single- 
detector  problem  is  presented  in  Section  3,  and  the  results  described  in  Section  4.  The  technical  discussion 
is  contained  in  Sections  5  and  6,  each  section  being  devoted  to  the  proof  of  a  main  result.  A  useful  technical 
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lemma  is  given  in  the  appendix  for  completeness. 
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2  PROBLEM  FORMULATION: 


In  this  paper,  a  Bayesian  formulation  is  adopted  for  the  problem  loosely  described  in  the  introduction. 
Let  (O,  J,  P)  be  a  probability  triple  carrying  a  {0,  l}-valued  random  variable  (RV)  H  and  two  JR-valued 
processes  {Wt\  t  >  0}  and  {Wf,  t  >  0}.  Throughout  the  discussion,  it  is  assumed  that  under  P, 

(Al):  The  RV  H  and  the  processes  {W/,  t  >  0}  and  {W(,  t  >  0}  are  mutually  independent, 

(A2):  Each  process  {lVt*,  t  >  0},  i  =  1, 2  ,  is  a  standard  Brownian  motion,  and 
(A3):  For  some  A  ,  0  <  A  <  1  , 

P[H  =  1}  =  A  =  1  —  P[H  =  0].  (2.1) 

The  JR-valued  processes  {X/,  t  >  0}  and  (X2,  t  >  0}  are  defined  as  outputs  of  the  dynamical  system 


or  equivalently, 


dX\  =  H  m  dt  +  cTidW}, 

X'0  =  0 


*  =  1,2, 


X't  =  H  mt  +  cri  W't,  t>  0,  t  =  1,  2, 


(2.2) 


(2.3) 


where  Hi  0  and  o\  ^  o  axe  nonrandom. 

To  fix  the  notation,  let  {7tw  ,  t  >  0}  and  {Jf  ,  t  >  0}  be  the  (P-completion  of)  the  filtrations  generated 
by  the  processes  {[Vtl,  t  >  0}  and  (XJ,  t  >  0},  respectively.  Let 


9i~7tx'vo{H),  t  >  0 


(2.4) 


with 

5<  ■=  V  51  (2.5) 

t>o 

The  decision  policy  for  decision-maker  i  ,  i  =  1, 2,  involves  the  selection  of  a  termination  time  rt-  and  of  a 
binary  valued  decision  8{,  and  an  admissible  decision  policy  for  decision-maker  i  is  thus  any  pair  7 ,•  =  (tv,  8{) 
of  RV’s,  where  rv  is  an  7X  -stopping  time  and  6i  is  an  7. X  -measurable  {0,  l}-valued  RV.  Denote  by  Tv  the 
collection  of  all  admissible  decision  policies  for  decision-maker  i  and  let  F  :=  Ti  X  T2- 

Let  fci  and  &2  t>e  positive  constants  and  let  C  be  a  mapping  {0, 1}  x  {0, 1}  X  {0, 1}  — ►  JR+.  For  every 
pair  7  =  (71,^2)  in  T,  define  the  cost  function  J( 7)  to  be 

J (7)  -  E\  kin  +  fc2r2  +  C{6u62]  H)  ].  (2.6) 

The  problem  (P)  investigated  in  this  paper  is  then 
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(P):  Minimize  Jx^)  overT. 

An  admissible  policy  7*  in  T  is  said  to  solve  problem  (P)  if 

J(l*)  <  J(l)-  for  dl  7  in  T.  (2.7) 

The  purpose  of  this  paper  is  to  identify  the  structure  of  an  optimal  policy. 
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3.  THE  SINGLE-DETECTOR  PROBLEM 


Consider  the  following  single-detector  sequential  hypothesis  testing  problem  defined  say  on  some  underly¬ 
ing  probability  triple  (fl,  /,  P).  A  decision-maker  observes  the  increments  of  a  stochastic  process  {Xt,  t>  0} 
with  values  in  1R.  Under  each  hypothesis,  the  observed  data  include  a  signal  which  is  the  output  of  a  stochas¬ 
tic  dynamical  system,  i.e., 

dXt  —  (mH  dt  +  a  dWt  (3.1) 

where  as  before  0  and  <7^0  are  nonrandom,  and  {Wt,  t  >  0}  is  a  standard  Brownian  motion. 

Data  is  observed  continuously  starting  at  an  initial  time  which  is  taken  for  convenience  to  be  zero.  At 
each  time  t  >  0,  the  decision  maker  can  either  declare  one  of  the  hypotheses  to  be  true  or  continue  collecting 
data.  Decisions  are  made  so  as  to  optimize  a  cost  function  with  two  components,  the  first  one  capturing  the 
cost  for  collecting  data,  and  the  second  one  assessing  the  cost  for  incorrect  decisions. 

More  formally,  let  {7X ,  t  >  0}  be  the  (P-completion  of  the)  filtration  generated  by  the  process  (Xt,  t  > 
0}.  An  admissible  policy  is  any  pair  7  =  (r,  5)  of  RV’s  where  t  is  an  ^-stopping  time  and  5  is  a  {0,  l}-valued 
fx  -measurable  RV.  The  collection  of  all  admissible  policies  is  denotes  by  T».  The  cost  corresponding  to 
any  7  in  T,  is  given  by 

J(75  k,  c)  =  E\  kr  +  c  1[6  H\  \  (3.2) 

The  problem  faced  by  the  single  detector  is  the  to  find  7*  in  T,  such  that 

J(7*;fc,c)  <  J(7;fc,c)  (3.3) 


for  all  other  7  in  T, . 

Let  {nt,  t  >  0}  be  the  ^-adapted  process  defined  by 

:=  P[H=  1  |  7tx],  t>  0.  (3.4) 

Note  that  nt  is  the  posteriori  probability  of  hypothesis  H 1  given  the  observations  7X . 

Theorem  3.1.  The  admissible  decision  policy  7*  =  ( r* ,8 *)  given  by 


is  optimal  in  the  sense  of  (S.S). 
equations 


t *  =  inf{t  >  0  |  7Tt  7^  ( A *,  B*)}, 

c.  f  1,  >  B\ 

\o,  7 rr.  <A*, 


(3.5a) 

(3.56) 


The  constants  A *  and  B *  are  the  unique  solutions  to  the  transcendental 


2c 


2/ccr2 

M2 


(¥'(j4*)  -  $'(£*)) 


(3.6) 


6 


c(  1  -  5*)  =  cA *  +  (B*  -  A*)( c  - 


(3.7) 


ka1 

H' 

where  =  (1  —  2x)  log(x/(l  —  a:)). 

Proof:  See  (  [5],  Theorem  5,  p.185). 

It  is  further  known  (5)  that  for  all  admissible  policies  of  the  form  (3.5)  with  constants  (A,  B)  satisfying 
0<A<1<5<OO  and  5,  the  relations 

2  2 

E\t  I  H  =  !]  =  2^2  w(P>a)>  E[r\H  =  0}  =  2^w(a^)  (3.8) 

H  n 


hold  with 

1  —  jj  2 

w(x,  y)  =  (1  -  x)  log (  —  - )  +  X  log(  YT^)>  0  <  X,  y  <  1, 

and 

(1-5)(A-A)  A(5-A) 

(B  —  A)(l  —  A)  ’  P  \(B  —  A) ' 


(3.9) 


(3.10) 
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4.  THE  RESULTS: 


Throughout  the  discussion,  the  decision  cost  satisfies  the  natural  conditions 


C(m,  n;  n)  >  C{n,  n;  n) 

(4.1) 

C(m,  m;  n)  >  C(n,  m\  n) 

(4.2) 

C (m,  n;  h)  —  C(n,m\h) 

(4.3) 

for  all  n  ,  m  ,  h  in  {0, 1},  with  m  ^  n.  In  other  words,  it  always  cost  more  to  make  an  error  than  it  does 
to  make  the  correct  decision,  regardless  of  which  decision  is  taken  by  the  other  decision-maker.  As  readily 
checked,  there  is  no  loss  in  generality  in  assuming  zero  cost  for  correct  decisions,  i.e., 

C(1,1;1)  =  C(0>0;0)  =  0,  (4.4) 


an  assumption  enforced  from  now  on. 

To  state  the  results  of  this  paper,  it  is  convenient  to  introduce  the  Jx  -adapted  processes  {n't,  t  >  0} 
and  t  >  0},  i  =  1, 2,  where 


<  :=  P\H  =  1|^X<],  t>0,  (4.5) 

and 

4>\  :=  exp(  ^{X\  -  i//<)]>  *  >  0-  (4.6) 

ai  Z 

For  each  decision-maker,  a  policy  7,-  =  (tv,  5,)  in  I\,  t  =  1,  2,  is  said  to  be  of  threshold  type  if  there 
exists  constants  A,-  and  Bt,  with  0  <  Ai  <  B{  <  1,  such  that 


and 


n  ••=  inf {t  >  0  :  7 rj  ^  (A,-,  £<)} 

f  1  if  xit  >  Bi 


Si  := 


0  if  Tri  <  Ai. 

•x  — 


(4.7) 


(4.8) 


As  is  well  known  [5,  p.  181],  the  relation 


x,  = 


A  4 


t  >  0, 


(1  —  A)  +  A  4>\ 

holds.  Therefore,  any  threshold  policy  for  decision-maker  i  with  constants  (A,-,  Bi)  takes  the  form 


(4.9) 


TV  :=  inf  (t  >  0  :  <f>\  <£  (ai;  6;)} 


(4.10) 
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and 


where  the  new  thresholds  a,-  and  6;  are  given  by 


(4.11) 


&)(&)■ 


(4.12) 


From  now  on,  a  threshold  policy  7,-  in  I\-  will  be  described  in  the  form  (4.10)-(4.11)  and  will  thus  be 
identified  with  two  threshold  constants  a ,  and  ,  0  <  a,-  <  1  <  a,-  ^  6,-.  Let  7;  be  the  collection  of  all 

threshold  policies  in  I\-  for  decision-maker  t,  with  the  notation  T  =  Ti  X  T2. 


THEOREM  4.1:  Under  the  foregoing  assumptions, 


(4.13) 


i.e.,  only  threshold  policies  need  to  be  considered  in  solving  problem  (P). 

For  simplicity  of  exposition,  the  discussion  is  carried  out  under  the  additional  assumption  that  the 
decision  cost  has  the  form 

{0  for  d±  =  d-2  —  h 

e  for  di  d-i  (4.14) 

/  for  di  =  d2  7^  h. 

The  general  case  can  be  handled  in  a  similar  way  but  the  calculations  are  more  involved.  Details  are  worked 
out  in  a  lengthier  version  of  this  paper  [3], 

Let  7!  and  72  be  threshold  policies  with  parameters  (ai,i>i)  and  (a2,  b2),  respectively,  and  pose 


Oi  =  p[*  =  i|fr  =  o],  ft  =  p[ft  =  o|fr  =  i],  t  =  1, 2, 


(4.15) 


the  so-called  error  probabilities  of  the  first  and  second  kind,  respectively,  or  in  radar  parlance,  the  false 
alarm  and  miss  probabilities. 

THEOREM  4.2:  Under  the  foregoing  assumptions,  any  pair  of  threshold  policies  71  and  72  with  parameters 
(di,&i)  and  (a2,62),  respectively,  incurs  a  cost  J( 7)  given  by  the  expression 

J(ri)  =  j{ 7i;fci>e)  +  J{ 72;fc2,e)  4-  (/  -  2e)[\PiP2  +  (1  -  A)aia2]  (4.16) 

where  J(t,  k,  c)  is  the  function  defined  in  (S.S). 

Together  Theorems  4.1  and  4.2  thus  reduce  the  search  for  a  solution  7*  to  problem  (P)  to  a  mathematical 
programming  problem  in  the  variables  (07,  /?i,  a2,  P2)  over  a  simple  constraint  set. 

THEOREM  4.3:  Under  the  foregoing  assumptions,  problem  (P )  is  solved  by  a  pair  7*  —  (71,72)  of 
threshold  policies  in  T  . 
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5.  A  DISCUSSION  OF  THEOREM  4.1. 


The  discussion  given  below  hinges  upon  arguments  very  similar  to  the  ones  proposed  by  Teneketzis  in 
the  discrete-time  case  [6],  as  would  be  expected.  Here  too,  the  basic  idea  lies  in  switching  attention  from 
optimality  for  problem  (P)  defined  in  (2.7)  to  person-by-person  optimality,  as  understood  in  the  literature 
on  multi-agent  decision-making.  Under  the  strong  independence  assumptions  made  here,  this  procedure 
leads  to  a  decoupling  of  the  two  decision-makers.  Some  of  the  details  underlying  this  line  of  reasoning  are 
presented  in  the  next  two  technical  lemmas. 

For  every  admissible  policy  72  in  T2>  define  the  mapping  C1a:  {0, 1}  X  {0, 1}  — +  M  by 

£7i(duh)  :  =  E[C{du62-h)\H=h) 

=  C{du  1;  h)  P[S2  =  1 1  H  =  h]  +  C(d!,0;  h)  P[S2  =  0\H  =  h ]  (5.1) 

for  all  [di,h)  in  {0, 1}  x  (0, 1}.  The  conditions  (4.1)-(4.2)  readily  imply  that  this  mapping  satisfies  the 
condition 

C^(Suh)>C^(h,h).  (5.2) 

for  all  (di,  h)  in  {0,  1}  X  {0,  1}. 

Lemma  5.1:  Under  the  foregoing  assumptions,  the  a-fields  Qx  and  Cf2  are  conditionally  independent 
given  H . 

Proof:  This  readily  follows  from  the  assumptions  (Al),  (2.3)  and  the  obvious  cr-field  identity 

9i  =  7txi  V  <r{H)  =  T™iV  o{H),  t>  0.  □ 

Lemma  5.2.  With  the  notation  (5.1),  the  relation 

E[C(51,S2]H))  =  E[C1,(S1,H)}  (5.3) 

holds  true  for  every  pair  7  =  (71,72)  in  T. 

Proof:  By  standard  properties  of  conditional  expectations,  the  equalities 

E[C(5u52;  H)}  =  E\  E\C(S1,S2 ;  H  \  Qx)  ]  (5.4) 

=  E{  E[C(di,  82\  h)  |  p1]<i1=«1,h=H  ]  (5-5) 

are  readily  obtained.  From  Proposition  A.l  of  the  appendix  and  from  Lemma  5.1,  it  follows  that 

E\C(du  S2- h)  |  Q1}  =  E[C(du62]  h)  |  a(Il) ]  (5.6) 
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for  all  (di,h)  in  {0,1}  X  {0,1}.  Substitution  of  this  last  relation  into  (5.5)  and  use  of  the  definition  (5.1) 
now  yield  the  result.  □ 

Proof  of  Theorem  4.1: 

With  7  =  (71,72)  in  T,  Lemma  5.2  implies  that 

J(7)  =  E\k1r1  + k2T2  + 0^,(6,.,  H)] 

=  k^E[  r2]  +  «Ly,(7i),  (5.7) 

where 

*Pra(7i)  =  E[  ^iri  +  C7,(5i,  H)  ].  (5.8) 

The  sequential  hypothesis  problem  (P/72)  faced  by  decision  maker  1,  with  72  fixed,  takes  the  form 

(P/72)  Minimize  J~,^( 71)  over  Fi. 

Under  the  assumed  conditions,  this  problem  is  a  single-detector  sequential  hypothesis  testing  problem  with 
decision  cost  of  the  type  discussed  in  Section  3.  Here  k  —  ki  and  c  =  e.  As  pointed  out  in  Theorem  3.1, 
the  admissible  policy  7J  in  that  solves  problem  (P/72)  is  of  threshold  type  ([5],  Theorem  5,  p.  185). 
Therefore,  for  every  pair  7  in  F,  there  exists  a  threshold  policy  71  in  7i  such  that 

^72(71)  5:  (5-9) 

or  equivalently, 

^(71.72)  <  ^(71,72)  =  ^(7)-  (5.10) 

By  symmetry,  interchanging  the  role  of  71  and  72,  there  exists  a  threshold  policy  72  in  T2  such  that 

•^(71.72)  <  ^(7)-  (5.11) 

This  last  step,  when  applied  with  the  pair  (71,72)  (instead  of  (71,72))  yields  the  existence  of  a  threshold 
policy  72  in  T  such  that 

^(71,72)  <  ^(71.72)-  (5.12) 

This  shows  that  to  every  admissible  policy  7  in  F,  there  corresponds  an  admissible  policy  7  in  T  such  that 

J( 7)  <  J{ 7),  (5.13) 

and  the  result  of  Theorem  4.1  is  now  immediate  since  T  C  T.  □ 
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6.  Proof  of  Theorem  4.2: 


Let  'i  be  an  admissible  policy  T.  By  elementary  calculations,  the  corresponding  cost  is  given 
/(•y)  =  E^ k\Ti  +  A;2r2  +  C(5i,  S2 ;  H)  j 

=  +  k2r2  +  C(  1, 82]  H)  P[6i  =  l  \  H \  +  C(0, 82 ;  H)  (1  —  P[5i  =  1 1  H ])  j 

=  E[kin  +  k2T2  +  C{  1, 1;  H)  P{81  =  1  |  H }  P[S2  =  1  |  H] 

+  C(l,0-,H)P[61  =  l\H\  P{82  =  0\H] 

+  C(0,l;H)(l-P{81  =  l\H})P\62  =  l\H) 

+  C(0,0;H)  (1  -  P[fi!  =  1  |  H})(l-P[82  =  1])] 

=  fci 7-j  +  k2r2  j  +  A  jC(l,  1;  1)(1  —  ^i)(l  —  P2)  +  C(l,  0;  1)(1  —  Pi)fh 

+  C( 0, 0;  1)^(1  -  p2)  +  C(0, 0;  1  )ft/?2] 

+  (1  -  A)  [c(l,  l;0)a1O'2  +  C(l,0;0)ai(l  -  a2) 

+  C{ 0,  1;0)(1  -  <*i )a2  +  C(0,0;0)(l  -  c*i)(l  -  a2)] 

=  E  +  k2r2  j  +  A  je(/?i  +  @2 )  +  {k  ~  2e)/?i/?2J 

+  (1  -  A)  ^e(ai  +  <*2)  +  (A:  —  2e)aia2J 

where  aj  and  $  were  defined  in  (4.16). 

In  order  to  compute  the  first  expectation  term  in  (6.5),  it  is  convenient  to  observe  that 

JsM  =A jE?[r<  |//=l]  +  (1-A)#[n  |  JET  =  0],  t  =  l,2 

thus  reducing  the  final  step  of  the  computation  to  the  evaluation  of  the  conditional  expectations 


whose  expressions  are  given  in  (3.8).  The  result  of  Theorem  4.2  now  follows. 


7.  CONCLUSIONS: 


Based  on  Theorem  4.2  an  explicit  solution  can  be  computed  for  the  thresholds  of  each  agent  once  the 
parameters  of  the  problem  are  given  (ki,  k%,  e,  /).  The  mathematical  programming  problem  described  in 
Theorem  4.2  is  readily  seen  to  have  a  unique  solution,  and  can  be  solved  by  a  variety  of  methods,  say  a 
Newton-like  scheme.  By  solving  (6.5)  for  the  constants  (ai,  fii,  a2,  ^2),  the  thresholds  are  easily  determined 
via  the  relations  (3.10)  and  (4.12). 
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APPENDIX 


On  some  probability  triple  (Q,7,P),  consider  three  cr-fields  7\,  7%  and  /a,  and  X  be  an  integrable 
JR-valued  RV. 

Proposition  A.l:  If  the  cr-fields  7i  and  7i  are  conditionally  independent  given  the  cr-field  7s,  then 

E[X\72V73}  =  E[X\T3}  P-a.s.  (Al) 

whenever  X  is  an  Immeasurable  RV. 

Proof:  By  virtue  of  the  Monotone  Class  Theorem,  it  suffices  to  establish  (A.l)  for  any  bounded  7) -measurable 
RV.  To  that  end,  let  F 3  and  Fs  be  arbitrary  elements  of  the  cr-fields  7i  and  7s,  respectively.  By  the  very 
definition  of  conditional  expectation, 

E[  IfjOFj  E[X  I  7-2  V  J3  ]  ]  =  E[  ljr3  x  1^  ]  (A2) 

=  E[lFiE[lFjX\73}\  (A3) 

=  E[lFiE\lFl\7s}E[X\7s\},  (A4) 

where  the  last  equality  was  obtained  by  making  use  of  the  conditional  independence  of  the  cr-fields  7\  and 
7-z  given  7s .  The  ^-measurability  of  the  RV’s  If,  and  E\X  )  7s )  now  readily  yields 

E[lFiE\\F,  \7s}E\X\7s]}  =  E[E[lFsE\X\7s\lF, \7s}\  (A5) 

=  E[lF,nF3E[X\7s}}  (A6) 

via  the  smoothing  property  for  conditional  expectations. 

Consequently, 

■S’!  nFj  E[X  \  I2V  7s}}  =  E\lF,nF,  E\X  \  73  \  ]  (A7) 

and  the  result  (Al)  immediately  follows  by  standard  arguments.  □ 
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